Nonparametric Methods for Online Changepoint Detection
نویسنده
چکیده
Changepoints have been extensively analysed in order to identify structural changes in time series data, typically when the data are of known parametric form. This report presents an exploration of methods to detect changepoints in a nonparametric setting, where no assumptions are made with regard to the distributional structure of the data, yet must still maintain a specified level of performance. In particular, the framework of a two-sample hypothesis testing procedure for sequential testing is developed, in which test statistics based on ranks of observations are adapted for changepoint detection. This framework is then extended to consider multiple changepoints and data streams. The characteristics and performance of the testing procedure are analysed by comparing the impact of test statistics in a range of scenarios. Under this framework, it is found that while parametric techniques tend to outperform nonparametric techniques in a Gaussian setting, nonparametric tests are a suitable alternative. In addition, it is found that tests for arbitrary distributional changes are comparable to tests designed to detect changes in location and scale. Overall, the nonparametric hypothesis testing procedure is found to perform well, and represents a logical course of action when performing changepoint analysis on data of no known distributional form, a common scenario that applies to a wide variety of real-world processes.
منابع مشابه
Two Statistical Methods for the Detection of Environmental Thresholds
A nonparametric method and a Bayesian hierarchical modelling method are proposed in this paper for the detection of environmental thresholds. The nonparametric method is based on the reduction of deviance, while the Bayesian method is based on the change in the response variable distribution parameters. Both methods are tested using macroinvertebrate composition data from a mesocosm experiment ...
متن کاملBayesian Online Changepoint Detection
Changepoints are abrupt variations in the generative parameters of a data sequence. Online detection of changepoints is useful in modelling and prediction of time series in application areas such as finance, biometrics, and robotics. While frequentist methods have yielded online filtering and prediction techniques, most Bayesian papers have focused on the retrospective segmentation problem. Her...
متن کاملCHAMP: Changepoint Detection Using Approximate Model Parameters
We introduce CHAMP, an algorithm for online Bayesian changepoint detection in settings where it is difficult or undesirable to integrate over the parameters of candidate models. Rather than requiring integration of the parameters of candidate models as in several other Bayesian approaches, we require only the ability to fit model parameters to data segments. This approach greatly simplifies the...
متن کاملDistribution-Free Changepoint Detection for Nonlinear Profiles
We consider changepoint detection in a process in which the observed points are profiles: large sets of functionally related points (x, y). Few changepoint detection methods have been proposed that don’t rely in some capacity on the assumption that the observational errors are normally distributed. In this paper, a nonparametric distribution-free wavelet method is proposed for monitoring for ch...
متن کاملTHE UNIVERSITY OF BRITISH COLUMBIA DEPARTMENT OF STATISTICS TECHNICAL REPORT #236 On-line Changepoint Detection and Parameter Estimation for Genome-wide Transcript Analysis
We consider the problem of identifying novel RNA transcripts using tiling arrays. Standard approaches to this problem rely on the calculation of a sliding window statistic or on simple changepoint models. These methods suffer from several drawbacks including the need to determine a threshold to label transcript regions and/or specify the number of transcripts. In this paper, we propose a Bayesi...
متن کامل